Feature/video support in random mm dataset #25963

BloodAxe · 2025-09-30T15:51:52Z

Allow benchmarking models using random-mm dataset with video inputs

Purpose

Can now do this:

vllm bench serve \
  --backend openai-chat --endpoint /v1/chat/completions \
  --dataset-name random-mm --num-prompts 256 \
  --model nvidia/Cosmos-Reason1-7B \
  --max-concurrency 32 \
  --random-prefix-len 0 \
  --random-input-len 30 \
  --random-output-len 128 \
  --random-mm-base-items-per-request 1 \
  --random-mm-num-mm-items-range-ratio 0 \
  --random-mm-bucket-config '{(512, 512, 16): 1.0}' \
  --request-rate inf \
  --ignore-eos \
  --seed 42

Signed-off-by: Eugene Khvedchenia <[email protected]>

…hen generating random inputs (This is to avoid inserting mm-related tokens which may confuse VLM models) Signed-off-by: Eugene Khvedchenia <[email protected]>

Signed-off-by: Eugene Khvedchenia <[email protected]>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

vllm/benchmarks/datasets.py

tests/benchmarks/test_random_multimodal_dataset_video.py

Signed-off-by: Eugene Khvedchenia <[email protected]>

tomasruizt · 2025-10-06T11:58:39Z

I see you generate a temporary mp4 file, dump the video into it, read into bytes, and then send it in base64 encodings. I suspect that passing the reference to the temporary file in the payload rather than base64 encoding would speed up inference. By building on top of your code we could easily answer this hypothesis with hard facts 👍

BloodAxe · 2025-10-06T12:57:15Z

I see you generate a temporary mp4 file, dump the video into it, read into bytes, and then send it in base64 encodings. I suspect that passing the reference to the temporary file in the payload rather than base64 encoding would speed up inference. By building on top of your code we could easily answer this hypothesis with hard facts 👍

What my action points in that regard should be?

tomasruizt · 2025-10-06T13:00:18Z

I don't think you need to change this PR to enable the comparison I mentioned. Its only a potential follow-up.

Signed-off-by: Eugene Khvedchenya <[email protected]>

vllm/benchmarks/datasets.py

ywang96 · 2025-10-16T07:13:45Z

I'm going to turn on ready label so that you can see if the benchmark tests pass.

Signed-off-by: Eugene Khvedchenia <[email protected]>

…de specific tokens Signed-off-by: Eugene Khvedchenia <[email protected]>

Signed-off-by: Eugene Khvedchenia <[email protected]>

BloodAxe added 5 commits September 30, 2025 17:16

Add video support for random-mm dataset

fc637f1

Signed-off-by: Eugene Khvedchenia <[email protected]>

Add video support for random-mm dataset

1f0f2e7

Signed-off-by: Eugene Khvedchenia <[email protected]>

Add video support for random-mm dataset

5f10fb6

Signed-off-by: Eugene Khvedchenia <[email protected]>

Add video support for random-mm dataset

cd3c843

Signed-off-by: Eugene Khvedchenia <[email protected]>

Add video support for random-mm dataset

87d12aa

Signed-off-by: Eugene Khvedchenia <[email protected]>

mergify bot added the performance Performance-related issues label Sep 30, 2025

BloodAxe added 2 commits October 1, 2025 11:57

Added dependency for opencv for benchmarking

5a4fcf8

Signed-off-by: Eugene Khvedchenia <[email protected]>

Fix bug random-mm dataset - ensure that we don't use special tokens w…

fc2d72c

…hen generating random inputs (This is to avoid inserting mm-related tokens which may confuse VLM models) Signed-off-by: Eugene Khvedchenia <[email protected]>

mergify bot added the ci/build label Oct 1, 2025

Code formatting

8e4015d

Signed-off-by: Eugene Khvedchenia <[email protected]>

BloodAxe marked this pull request as ready for review October 3, 2025 18:46

Merge branch 'main' into feature/video-support-in-random-mm-dataset

e12bd5c

chatgpt-codex-connector bot reviewed Oct 3, 2025

View reviewed changes

vllm/benchmarks/datasets.py Show resolved Hide resolved

vllm/benchmarks/datasets.py Outdated Show resolved Hide resolved

tests/benchmarks/test_random_multimodal_dataset_video.py Show resolved Hide resolved

BloodAxe added 6 commits October 6, 2025 13:05

Merge main

2eda6ea

Signed-off-by: Eugene Khvedchenia <[email protected]>

Remove mistakengly put (self) to module-level function

933d1ef

Signed-off-by: Eugene Khvedchenia <[email protected]>

Do not import cv2 at the module level

a5b588b

Signed-off-by: Eugene Khvedchenia <[email protected]>

Remove debug prints

fab3ec4

Signed-off-by: Eugene Khvedchenia <[email protected]>

Remove debug prints

8869f0a

Signed-off-by: Eugene Khvedchenia <[email protected]>

Fix issue of not excluding all special tokens

3ca2ec3

Signed-off-by: Eugene Khvedchenia <[email protected]>

Merge branch 'main' into feature/video-support-in-random-mm-dataset

64b0eec

Signed-off-by: Eugene Khvedchenya <[email protected]>

Isotr0py reviewed Oct 16, 2025

View reviewed changes

vllm/benchmarks/datasets.py Outdated Show resolved Hide resolved

ywang96 reviewed Oct 16, 2025

View reviewed changes

vllm/benchmarks/datasets.py Show resolved Hide resolved

vllm/benchmarks/datasets.py Outdated Show resolved Hide resolved

ywang96 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 16, 2025

ywang96 and others added 3 commits October 16, 2025 00:13

Merge branch 'main' into feature/video-support-in-random-mm-dataset

7105106

Merge branch 'main' into feature/video-support-in-random-mm-dataset

e7aebe1

Change info logging to debug

78a808c

Signed-off-by: Eugene Khvedchenia <[email protected]>

BloodAxe added 3 commits October 20, 2025 14:52

Change info logging to debug & update comment explaining why we exclu…

26c50d6

…de specific tokens Signed-off-by: Eugene Khvedchenia <[email protected]>

Remove import guard for cv2

f0c6cdc

Signed-off-by: Eugene Khvedchenia <[email protected]>

Merge branch 'main' into feature/video-support-in-random-mm-dataset

ee30d58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Feature/video support in random mm dataset #25963

Feature/video support in random mm dataset #25963

BloodAxe commented Sep 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tomasruizt commented Oct 6, 2025

Uh oh!

BloodAxe commented Oct 6, 2025

Uh oh!

tomasruizt commented Oct 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ywang96 commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Feature/video support in random mm dataset #25963

Are you sure you want to change the base?

Feature/video support in random mm dataset #25963

Conversation

BloodAxe commented Sep 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tomasruizt commented Oct 6, 2025

Uh oh!

BloodAxe commented Oct 6, 2025

Uh oh!

tomasruizt commented Oct 6, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ywang96 commented Oct 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

BloodAxe commented Sep 30, 2025 •

edited by github-actions bot

Loading